Scalable Detection of Emerging Topics and Geo-spatial Events in Large Textual Streams

نویسندگان

  • Erich Schubert
  • Michael Weiler
  • Hans-Peter Kriegel
چکیده

Key Ideas of our Solution • From statistics: control charts for change detection. • From computational linguistics: Analyze word cooccurrences for more meaningful results. • From mathematics: Exponentially weighted moving averages for streaming operation. • From databases: Hashing and Count-Min sketches for scalability to large data. • From data mining: Clustering of word pairs into simple “topics” based on cooccurrences. • From visualization: Word-cloud like visualization, but incorporating the relationships of words. • Integrate geographic information by mapping coordinates to tokens similar to text.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SPOTHOT: Scalable Detection of Geo-spatial Events in Large Textual Streams

The analysis of social media data poses several challenges: first of all, the data sets are very large, secondly they change constantly, and third they are heterogeneous, consisting of text, images, geographic locations and social connections. In this article, we focus on detecting events consisting of text and location information, and introduce an analysis method that is scalable both with re...

متن کامل

Spatial Semantic Scan: Detecting Subtle, Spatially Localized Events in Text Streams

Many methods have been proposed for detecting emerging events in text streams using topic modeling. However, these methods have shortcomings that make them unsuitable for rapid detection of locally emerging events on massive text streams. We describe Spatially Compact Semantic Scan (SCSS) that has been developed specifically to overcome the shortcomings of current methods in detecting new spati...

متن کامل

GeoScope: Online Detection of Geo-Correlated Information Trends in Social Networks

The First Law of Geography states “Everything is related to everything else, but near things are more related than distant things”. This spatial significance has implications in various applications, trend detection being one of them. In this paper we propose a new algorithmic tool, GeoScope, to detect geo-trends. GeoScope is a data streams solution that detects correlations between topics and ...

متن کامل

Semantic Scan: Detecting Subtle, Spatially Localized Events in Text Streams

Early detection and precise characterization of emerging topics in text streams can be highly useful in applications such as timely and targeted public health interventions and discovering evolving regional business trends. Many methods have been proposed for detecting emerging events in text streams using topic modeling. However, these methods have numerous shortcomings that make them unsuitab...

متن کامل

GeoWatch: Online detection of Geo-Correlated Information Trends In Social Networks

Detecting information trends in online social networks is an important problem that has attracted the attention of both the industry and the research community in recent years. Global trends, information items that are trendy in the entire social network, can be detected using existing data streams techniques. However, detecting global trends is only the first step in understanding online socia...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016